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Abstract 

Time-Division Duplexing (TDD) allows to estimate the downlink channels for an arbitrarily large 
number of base station antennas from a finite number of orthogonal pilot signals in the uplink, by 
exploiting channel reciprocity. Therefore, while the number of users per cell served in any time-frequency 
channel coherence block is necessarily limited by the number of pilot sequence dimensions available, the 
number of base station antennas can be made as large as desired. Based on this observation, a recently 
proposed very simple "Massive MIMO" scheme was shown to achieve unprecedented spectral efficiency 
in realistic conditions of user spatial distribution, distance-dependent pathloss and channel coherence 
time and bandwidth. 

The main focus and contribution of this paper is a novel network-MIMO TDD architecture that 
achieves spectral efficiencies comparable with "Massive MIMO", with one order of magnitude fewer 
antennas per active user per cell. The proposed architecture is based on a family of network-MIMO 
schemes defined by small clusters of cooperating base stations, zero-forcing multiuser MIMO precoding 
with suitable inter-cluster interference constraints, uplink pilot signals reuse across cells, and frequency 
reuse. The key idea consists of partitioning the users population into geographically determined "bins", 
such that all users in the same bin are statistically equivalent, and use the optimal network-MIMO 
architecture in the family for each bin. A scheduler takes care of serving the different bins on the time- 
frequency slots, in order to maximize a desired network utility function that captures some desired notion 
of fairness. This results in a mixed-mode network-MIMO architecture, where different schemes, each of 
which is optimized for the served user bin, are multiplexed in time-frequency. 

In order to carry out the performance analysis and the optimization of the proposed architecture in a 
clean and computationally efficient way, we consider the large-system regime where the number of users, 
the number of antennas, and the channel coherence block length go to infinity with fixed ratios. The 
performance predicted by the large-system asymptotic analysis matches very well the finite-dimensional 
simulations. Overall, the system spectral efficiency obtained by the proposed architecture is similar to 
that achieved by "Massive MIMO", with a 10-fold reduction in the number of antennas at the base 
stations (roughly, from 500 to 50 antennas). 



I. Introduction 

Multiuser MIMO (MU-MIMO) technology is being intensively studied for the next generation wireless 
cellular systems (e.g., LTE-Advanced |T|)- Schemes where antennas of different Base Stations (BSs) are 
jointly processed by centralized BS controllers are usually referred to as "network-MIMO" architectures 
(e.g., fl2j-[8 J). It is well-known that the improvement obtained from transmit antenna joint processing is 
limited by a "dimensionality" bottleneck (9j-|ll|. In particular, the high-SNR capacity of a single-user 



MIMO system with N t transmit antennas, N r receiving antennas, and fading coherence block length T 
complex dimensions^ scales as C(SNR) = min{iV 4 , N r , T/2} log SNR+0(1) (B), (T4J. Therefore, even 



by pooling all base stations into a single distributed macro-transmitter with JVj S> 1 antennas and all user 
terminals into a single distributed macro-receiver with N r 3> 1 antennas, the system degrees offreedon^j 
are eventually limited by the fading coherence block length T. While this dimensionality bottleneck is 
an inherent fact, emerging from the high-SNR behavior of the capacity of MIMO block-fading channels, 



1 14 1 (see also [15Q the same behavior also characterizes the capacity scaling of MU-MIMO systems 
based on explicit training for channel estimation, and can be interpreted as the effect of the "overhead" 
incurred by pilot signals fT6| ]. 

For frequency-division duplex (FDD) systems, the training overhead required to collect channel state 
information at the transmitters (CSIT) grows linearly with the number of cooperating transmit antennas. 
Such overhead restricts the MU-MIMO benefits that can be harvested with a large number of transmit 
antennas, as shown in [10] using system-level simulation and in [ 1 1 ] using closed-form analysis based 
on the limiting distribution of certain large random matrices. 



For Time Division Duplexing (TDD) systems, exploiting channel reciprocity |17|, [18], the CSIT can 
be obtained from the uplink training. In this case, the pilot signal overhead scales linearly with the 
number of active users per cell, but it is independent of the number of cooperating antennas at the BSs. 
As a result, for a fixed number of users scheduled for transmission, the TDD system performance can 
be significantly improved by increasing the number of BS antennas. 



Following this idea, Marzetta 1 18 1 has shown that simple Linear Single-User BeamForming (LSUBF) 



'The fading coherence block length T, measured in signal complex "dimensions" in the time-frequency domain is proportional 
to the product W C T C , where T c (s) denotes the channel coherence interval, and W c (Hz) denotes the channel coherence bandwidth 

2 The system Degrees of Freedom (DoFs) are defined as the pre-log factor of the system capacity C(SNR), i.e., DoFs = 
limsNR^oo lo SN p , and quantify the number of "equivalent" parallel single-user Gaussian channels, in a first-order approximation 
with respect to logSNR. 



and random user scheduling, without any inter-cell cooperation, yields unprecedented spectral efficiency 
in TDD cellular systems, provided that a sufficiently large number of transmit antennas per active user 
are employed at each BS. This scheme, nicknamed hereafter "Massive MIMO", was analyzed in the limit 
of infinite number of BS antennas per user per cell. In this regime, the effects of Gaussian noise and 
uncorrected inter-cell interference disappear, and that the only remaining impairment is the inter-cell 
interference due to pilot contamination |19j, i.e., to the correlated interference from other cells due to 
users re-using the same pilot signal (see Section [TTT])- 

In this work, we also focus on TDD systems and exploit reciprocity. The main contribution of this 
paper is a novel network-MIMO architecture that achieves spectral efficiencies comparable with "Massive 
MIMO", with a more practical number of BS antennas per active user (one order of magnitude less 
antennas for approximately the same spectral efficiency). As in (T8j, we also analyze the proposed 
system in the limit of a large number of antennas. However, a different system scaling is considered, 
where the number of antennas per active user per cell is finite. This is obtained by letting the number of 
users per cell, the number of antennas per BS, and the channel coherence block length go to infinity, with 



fixed ratios |TTJ. We find that in this regime the LSUBF scheme advocated in [18] performs very poorly. 
In contrast, we consider a family of network-MIMO schemes based on small clusters of cooperating base 
stations, Linear Zero-Forcing BeamForming (LZFBF) with suitable inter-cluster interference constraints, 
uplink pilot signals reuse across cells, and frequency reuse. The key idea consists of partitioning the users 
population into geographically determined "bins", containing statistically equivalent users, and optimizing 
the network-MIMO scheme for each individual bin. Then, users in different bins are scheduled over the 
time-frequency slots, in order to maximize an appropriately chosen network utility function reflecting 
some desired notion of "fairness". The geographic nature of the proposed scheme yields very simple 
system operations, where each time a given bin is scheduled, a subset of active users in the selected 
bin is chosen at random or in a deterministic round robin fashion, without performing any CSIT-based 
user selection. This allows a fast turn-around between feedback and transmission, that can take place in 
the same channel coherence block. The resulting architecture is a mixed-mode network-MIMO, where 
different schemes, each of which is optimized for the served user bin, are multiplexed in time-frequency. 
Using results and tools from the large-system analysis developed in (TTJ, p0| and adapted to the present 
scenario, we obtain the asymptotic achievable rate for each scheme in closed form. The performance 
predicted by the large-system analysis match very well with finite-dimensional simulations, in agreement 
with p"T[ , |2T| and with several well-known works on single-user MIMO in the large antenna regime |22j, 
(231. The large-system analysis developed here is instrumental to the systematic design and optimization 



of the proposed system architecture, since it allows an accurate and rapid selection of the best network- 
MIMO scheme for each user bin without resorting to cumbersome and time-consuming Monte Carlo 
simulation. In fact, the system parameters in the considered family of network-MIMO schemes are 
strongly mutually dependent, and the system optimization without the analytical tools developed here 
would just be infeasible. 

We hasten to say that the ideas of dynamic clustering of cooperating BSs, and multimodal MU-MIMO 



downlink have appeared in a large number of previous works (see for example |24|-[28]. Giving a fair 
account of this vast literature would be impossible within the space limits of this paper. Nevertheless, 
we wish to stress here that the novel contribution of this paper is a systematic approach to multi-modal 
system optimization based on simple closed-form expressions of the spectral efficiency of each network- 
MIMO scheme in the family, and on scheduling across the schemes (or "modes") in order to maximize 
a desired network utility function. 

The remainder of this paper is organized as follows. In Section |nj we describe the family of proposed 
network-MIMO schemes. We discuss the uplink training, MMSE channel estimation and pilot contam- 



ination effect for TDD-based systems in Section III In Section IV we analyze the network-MIMO 



architectures under considerations and and provide expressions for their achievable rate in the large- 
system limit. Scheduling under specific fairness criteria and the corresponding system spectral efficiency 
is presented in Section [V] Numerical results including comparison with finite dimensional simulation 



results are presented in Section VI and concluding remarks are given in Section VII 



II. System model 

The TDD cellular architecture for high-data rate downlink proposed in this work is based on the 
following elements: 

1) A family of network-MIMO schemes, defined in terms of the size and shape of clusters of 
cooperating BSs, pilot reuse across clusters, frequency reuse factor, and downlink linear precoding 
scheme; 

2) A partitioning of the user population into bins, according to their position in the cellular coverage 
area; 

3) The determination of the optimal network-MIMO scheme in the family for each user bin, creating 
an association between user bins and network-MIMO schemes; 

4) Scheduling of the user bins in time-frequency in order to maximize a suitable concave and com- 
ponentwise non-decreasing network utility function of the ergodic user rates. The network utility 



function is chosen in order to reflect some desired notion of fairness (e.g., proportional fairness pTJ, 



1 29 1-| 32 1). When a given bin is scheduled, the associated optimized network-MIMO architecture 



is used. 



Invoking well-known convergence results |22|, [23 1, we use the "large-system analysis" approach for 



multi-antenna cellular systems pioneered in [21 1, [33 1— [35 ]. In particular, we use the results of fTT| , 
which can be easily applied to our system model, and analyze the performance of the network-MIMO 
schemes in the considered family while scaling the number of users in each bin, the number of antennas 
per BS and the small-scale fading coherence block length to infinity, with fixed ratios. We define a system 
size parameter indicated by N, and let all the above quantities scale linearly with N — > oo. Specifically, 
we let MN denote the number of BS antennas, LN denote the channel coherence block length, and UN 
denote the number of users per location (a bin is defined as a set of discrete locations in the cellular 



coverage, see Section II-A I, for given constants M, L and U. 



A. Cellular layout and frequency reuse 

Base stations, cells and clusters: The system geometry is concisely described by using lattices on 



the real line R (for 1-dimensional layouts []8J, |10|) or on the real plane M 2 (for 2-dimensional layouts 
1 18 1). Consider nested lattices A C Ab s C A u in R (resp., R 2 ). The system coverage region is given by 



the Voronoi cell V of A centered at the originjjBSs are located at points b G Ab s n V. The finer lattice 
A u defines a grid of discrete user locations, as explained later in this section. We let B = |Ab s n V| 
denote the number of BSs in the system. 

Example 1: Consider the 1-dimensional layout defined by A = B"L and Ab s = Z. The coverage region 
is V = [-B/2, B/2) and the BS locations b are given by all integer-coordinate points in the interval 
l-B/2,B/2). 

Example 2: In system studies reported in the standardization of 4th generation cellular systems (36}, 
1 37 1 it is customary to consider a 2-dimensional hexagonal layout formed by 19 cells, as shown in Fig. [T] 
In this case, A bs = AZ 2 and A = ABZ 2 , with 



y/3r 
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where r denotes the distance between the center of a small hexagon and one of its vertices. We have 
B = det(B) = 19, and the distance between the closest two points in Ab s is V3r. 



3 The Voronoi cell of a lattice point x G A £ R" is the set of points y £ R closer to x than to any other lattice point. 



For the sake of symmetry, in order to avoid "border effects" at the edges of the coverage region, all 
distances and all spatial coordinates are defined modulo A. The modulo A distance between two points 



u,v in R (resp., IR 2 ) is defined as 



d\(u, v) = \u — v mod A|, (1) 



where x mod A = x — argmin^eA \x — A|. Cell V& is defined as the Voronoi region of BS b G Ab s nV with 
respect to the modulo-A distance, i.e., Vj = {i £ K : d\(x,b) < d^{x,b'), V b' € Ab s n V} (replace R 
with M 2 for the 2-dimensional case). The collection of cells {Vb} forms a partition of V into congruent 
regions. 

A "clustering pattern" u(C), defined by the set of BS locations C = {bo, . . . , bc-i} with bj G Ab s n V 
and rooted at bo = 0, is the collection of BS location sets (referred to as "clusters" in the following) 

u{C) = {{C + c] : ceA bs nV}. (2) 

We focus on systems based on single-cell processing (C = 1), or with joint processing over clusters 
of small size: C = 2 in the 1 -dimensional case, and size C = 3 in the 2-dimensional case, as shown 
in Fig. [2] It turns out that larger clusters do not achieve better performance due to the large training 
overhead incurred, while requiring higher complexity. Therefore, our results are not restrictive in terms 
of cluster size, since they capture the best system parameters configurations. 

Example 3: In the 1-dimensional case of Example [TJ with C = 1 and C = 2, we have 

^({0}) = {{0}, {!},...,{£ -I}}. 

and 

u({0,l}) = {{0,l},{l,2},...,{B-l,0}}, 

respectively. 

User location bins: We assume a uniform user spatial distribution over the coverage region. For the 
sake of analytical simplicity, we discretize the user distribution into a regular grid of user locations, 
corresponding to the points of the lattice translate A u = A u + no, where no ^ is chosen such that A u 
is symmetric with respect to the origin and no points of A u fall on the cell boundaries. 

Example 4: In the 1-dimensional case of Example fll we can choose A u = ^Z, for some even integer 
K, and let uq = ^. Then, the points of A u n Vb are symmetrically located with respect to each BS of 
coordinate b = 0, 1, . . . , B — 1. 



A "user bin" v(X), denned by the set of user locations X = {xo, x\, . . . , x m _i} with x-i G A u , is the 
collection of user location sets (indicated by "groups" in the following) 

v(X) = {{X + c} : ceA bs nV}. (3) 

In particular, we choose X to be a symmetric set of points with respect to the positions of the BSs 
comprising cluster C. The reason for this symmetry is two-fold: on one hand, a symmetric set generalizes 
the single location case and yet provides a set of statistically equivalent users (same set of distances from 
all BSs in the cluster), thus providing a richer system optimization parameter space. On the other hand, 



symmetry yields very simple closed-form expressions in the large-system analysis, by means of [11 Th. 

3]. 

Example 5: In the 1-dimensional case of Example [T| we are interested in the cases X = {—x, x} and 
X = {x, 1 — x}, for some x G A u n [0, 1/2], as shown in Fig. pi This yields the bins 



v({— x, x}) = {{— x,x}, {1 — x, 1 + x}, . . . , {B — 1 — x,B — 1 + x}} 

and 

v({x, 1 - x}) = {{x, 1 - x}, {1 + x, 2 - x}, . . . , {B - 1 + x, B - x}}, 

respectively. 

Cluster/group association and user group rate: The BSs forming a cluster are jointly coordinated by 
a "cluster controller" that collects all relevant channel state information and computes the beamforming 
coefficients for the desired MU-MIMO precoding scheme. For given sets {X, C}, the users in group X + c 
are served by the cluster C + c, for all c G Ab s n V (see Fig. [2]). By construction, each BS belongs to C 
clusters and transmits signals from all the C corresponding cluster controllers. These signals may share 
the same frequency band, or be defined on orthogonal subbands, depending on the system frequency 
reuse factor defined later in this section. There are mUN users in each group X + c, and CMN jointly 
coordinated antennas in each cluster C + c. We assume mil > CM, such that the downlink DoFs are 
always limited by the number of antennasjj The number of users effectively scheduled and served on 
each given slot is denoted by SN. We refer to these users as the "active users", and to the coefficient 
S G [0, CM] as the "loading factor". Depending on the geometry of X and C and on the type of 



beamforming used (see Section IV) S can be optimized for each pair {X,C}. We restrict to consider 



4 A system with rnU < CM is not fully loaded, in the sense that the infrastructure would support potentially a larger number 
of users. 



schemes that serve an equal number SN/m of active users per location x 6 X + c. As anticipated before, 
by symmetry the users in the same bin are statistically equivalent. Therefore, without loss of generality, 
we may assume that a round-robin scheduling picks all subsets of size SN out of the whole mUN users 
in each group with the same fraction of time. In this way, the aggregate spectral efficiency of the group 
(indicated in the following a "group spectral efficiency") is shared evenly among all the users in the 
group. 

Frequency reuse: The frequency reuse factor of the scheme is denoted by F. This can also be optimized 
for each given pair {X,C}. The system bandwidth is partitioned into F subbands of equal width. For 
F = 1, all clusters in u(C) transmit on the whole system bandwidth. For F > 1, clusters are assigned 
different subbands according to a regular reuse pattern. For the 1 -dimensional layout, any integer F 
dividing B is possible. For the 2-dimensional layout, we consider reuse factors given by F = i 2 + ij +j 2 



for non-negative integer i and j |38|. For later use, we define T>(f) as the set of clusters active on 
subband / € {0, . . . ,F - 1}. 

Example 6: Fig. [3] shows a 1 -dimensional system with frequency reuse F = 2 f or the clustering pattern 
of size C = 2 defined by C = {0, 1} and the user bin defined by X = {x, 1 — x}. Even-numbered clusters 
operate on subband and odd-numbered clusters operate on subband 1 . An example for the 2-dimensional 
hexagonal layout with F = 3 and C = 1 is shown in Fig. [TJ where cells with the same color operate on 
the same subband. 

B. Channel statistics and received signal model 

The average received signal power for a user located at x G V from a BS antenna located at 
b e V is denoted by g(x,b), a polynomially decreasing function of the distance d\(x,b). The AWGN 
noise power spectral density is normalized to 1. For fixed For a given clustering pattern u(C) and 
user bin v(X), the fading channel coefficients from the CMN antennas of BS cluster C + c to an 
active user k € {1, . . . ,S/m} at location x + d : x G X, on frequency subband /, form a random 
vector indicated by h k c , c (f;x) G C xl , with circularly-symmetric complex Gaussian entries, i.i.d. 

across the BS antennas, the subbands and the users (independent small-scale Rayleigh fading). In the 
considered network-MIMO schemes, active users are served with equal transmit power equal to 1/5. 
Hence, the total transmit power per cluster is equal to N. Since each BS simultaneously participates in 
C clusters, also the total transmit power per BS is equal to N. Since we consider the limit for N — > oo, 
the channel coefficients are normalized to have variance l/N, such that the received signal power is 
independent of N. This provides the correct scaling of the elements of the random channel matrices in 



order to obtain the large-system limit results. We let the channel vector covariance matrix be given by 
E [h fe)C , )C (/; z)h"cv(/; aO] = w G c'Ax), where G c%c (x) = diag (g{x + c', b + c)I MiV : 6 G C)fl Notice 
that G C ' )C (x) is independent of the user index k and on the subband index /, since the channels are 
identically distributed across subbands and co-located users. 

Under the standard block-fading assumption |TT|-|14}, |18|, the channel vectors are constant on each 



subband for blocks of length LN signal dimensions. Without loss of generality, we assume that these 
coherence blocks also correspond to the scheduling slot. Each slot is partitioned into an uplink training 
phase, of length LpN and a downlink data phase, of length L&N. In this section we deal with the data 



phase, while the training phase is addressed in Section III For the sake of notation simplicity, the slot 
"time" index is omitted: since we care about ergodic (average) rates, only the per-block marginal channel 
statistics matter. The data-bearing signal transmitted by cluster C + con subband / is denoted by 

X C (/) = U C (/)V 6 H (/) (4) 

where the matrix U c (/) G C D contains the codeword (information-bearing) symbols arranged by 

columns. We assume that users' codebooks are drawn from an i.i.d. Gaussian random coding ensemble 
with symbols ~ CJ\f(0, l/S). Achievable rates shall be obtained via the familiar random coding argument 



1 39 1 with respect to this input distribution. The matrix V c (/) G <£ CMNxSN contains the beamforming 
vectors arranged by columns, normalized to have unit norm. It is immediate to verify that, indeed, the aver- 
age transmit power of any cluster C+c, active on frequency /, is given by Tj^ytr (E [X^ (/)X C (/)] ) = N, 
as desired. 

Recalling the definition of T>(f), the received signal for user k at location x + c : x G X is given by 

yfc,c(/;*)= E u c(/)V?(/)h fclCl ^(/;a!) + zj feiC (/;a!) (5) 

cev(f) 

where Zk iC (f;x) ~ CM (0, 4Il d at). Notice that a scheme using frequency reuse F > 1 transmits with 
total cluster power N over a fraction 1/F of the whole system bandwidth. This is taken into account by 
letting the noise variance per component be equal to 1/F, in the signal model pi). 

By construction, the encoded data symbols for user k at location x + c : x G X, are the entries of the 
A:-th column of U c (/). The columns k' ^ k of U c (/) form the intra-cluster (multiuser) interference for 
user k. All other signals U c '(/), with c' G V(f),c' / c, form the Inter-Cluster Interference (ICI). As 

5 We use diag(M a : a G A) to indicate a block-diagonal matrix with diagonal blocks M a , for some index a taking values in 
the ordered set A, and I n to indicate the n x n identity matrix. 



seen in Section IV intra-cluster interference and ICI are handled by a combination of beamforming and 
frequency reuse. 

III. Uplink training and channel estimation 

The CSIT is obtained on a per-slot basis, by letting all the scheduled (i.e., active) users in the slot 
sent pilot signals over the LpiV dimensions dedicated to uplink trainingHWe fix {Af,C} and focus on 
the SN active users in the groups X + c : c G £>(/ )■ These users must send SN orthogonal pilot signals 
to allow channel estimation at their corresponding serving clusters C + c : c E T>(f). 

A. Pilot reuse scheme 

Let L-p = QS, where Q > 1 is an integer pilot reuse factor that can be optimized for each {X,C}. Let 
$ S £Q SNx Q SN b e a SC aled unitary matrix, such that $ H $ = a u \QSNlQSN, where a u \ denotes the 
uplink transmit power per user during the training phase. The columns of <& are partitioned into Q disjoint 
blocks of size SN columns each, denoted by <&o> • • • > &Q-1 ar *d referred to as training codebooks. These 
are assigned to the groups in a periodic fashion, such that the same training codebook <& g is reused every 
Q-th groups X + c : c S T>{f). For later use, we let q(c) E {0, . . . , Q — 1} denote the index of the 
training codebook allocated to group X + c, and define V(q, /) = {c G T>{f) : q(c) = q} as the set of 
clusters active on subband / and using training codebook q. Pilot reuse is akin frequency reuse, but in 
general Q and F may be different in order to allow for additional flexibility in the system optimization. 

Example 7: In the 1-dimensional layout with C = 2, C = {0, 1} and X = {x, 1 — x} we may 
have F = 1 (i.e., each cluster is active on the whole system bandwidth) and Q = 2 (i.e., two mutually 
orthogonal training codebooks are used alternately, such that the same set of uplink pilot signals is reused 
in every other cluster, as shown in Fig. |4]). 

B. MMSE channel estimation and pilot contamination 

The uplink signal received by the CMN antennas of cluster C + c : c € T>{f), during the training 
phase, is given by 

Y c (/)= Y, <f> q (c)K"M-> X ) + Z c(f)- (6) 

cev(f) 



18 



, also our analysis is slightly optimistic since it only accounts for the overhead and degradation due to uplink 
noisy channel estimation, while it assumes genie-aided overhead-free "dedicated training" to support coherent detection during 
data-transmission. As shown in 1161, the effect of noisy dedicated training is minor relatively to the CSIT estimation error. 



10 

Because of TDD reciprocity, the uplink channel matrix H c , c (/; X) G ^CMNxSN con i a [ ns the downlink 
channels h k c , c (f; x) arranged by columns, for all active users k = 1, . . . , SN/m at all locations x + d : 
xEX.ln Q, Z c (/) G c LpXCMAr denotes the uplink AWGN with components ~ CJ\f(0, 1). The goal of 
the uplink training phase is to provide to each cluster C + c an estimate of the channel vectors h fc c c (f; x) 
for all the active users in the corresponding served group X + c. 

By projecting Y c (/) onto the column of <& g ( c ) associated to user k at location x + c : x G X and 
dividing by a u \QSN, the relevant observation for estimating the h k c c (f; x) is given by 

r fc , c (/;^)= XI b*,c ) c(/;a ; )+a* ) c(/) ( 7 ) 

ceV{q(c)J) 
where n fc]C (/) ~ OA/"(0, (a ul QS'iV)- 1 IcA/7v)- For any d G 7>(g(c), /), the MMSE estimate oi\v kdc {f] x) 
from r k c (/; x) is obtained as 

-l 



h feiC/jC (/;x) = G c , jC (x) (a^QSy'lcMN + J2 G c,c(x) 

c"eV(q(c),f) 

Invoking the well-known MMSE decomposition, we can write 



r*,c(/;z) (8) 



hfe, C ',c(/; x ) = hk,c',c(fl x ) + e fc)C / lC (/; a?), (9) 

where the channel estimate h kc , c (f;x) and the error vector e kc , c (/;x) are zero-mean uncorrected 
jointly complex circularly symmetric Gaussian vectors (and therefore statistically independent due to 
joint Gaussianity). After some straightforward algebra (omitted for brevity), we obtain the covariance 
matrices E[h fcc , ]C (/;x)h fcc , )C (/; x)] = ^S cV (x) and E[e fe)CV (/;x)e£ c , jC (/;x)] = ^£ c < jC (x), where 
Sc',c(») = diag(^ jCjfc (/;x)lMAf : 6 G C) and £ c ', c (x) = diag (o- c > jCib (f;x)l M N ■ b G C), and where we 
define 

o-c,c,6(/;a;) = 7- 77-^ (10) 

£c',c,b(f;x) = g(x + d,b + d) -ov )C)6 (/;x) = 77 77 — TT C 11 ) 

with 

(f. x \ = g(x + d,b + c) 

C,C ' 6 ' (a u iQSy 1 + Y, C "eV(q(c)j)\c'9(x + d',b + c) 

The desired channel estimate at cluster C + c is given by h fc c c (/; x), obtained by letting d = c in (8 1 



- (12). Notice that the training phase observation r kc (f;x) in dVb contains the superposition of all the 
channel vectors h k c , c (/; x) of the users k at location x + d : x G X, for all c' G V(q(c), /), i.e., sharing 
the same pilot signal. This is the so-called pilot contamination effect, which is a major limiting factor 
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in the performance of TDD systems [18|, [ |T9j . Because of pilot contamination, the MMSE estimate 
h k c c (f; x) is correlated with the channels h fc c , c (f; x), for all d G V(q(c), /). 

Next, we express the channel vector h fc c , c (f; x) for d G V(q{c) , /) in terms of the estimate h k c c (/; x) 
and a component independent of h fccc (/;x). This decomposition is useful to proof the main results of 



Theorems 1, 2 and 3 in Section |rV-B| and it is the key to understand qualitatively the pilot contamination 



effect. From d8j), and since 5] cc (x) is invertible, we have 

£fc,c, c (/; x) = E [h k>c , )C (f; x) \r k Jf; x)] 

= G cV (x)G- c 1 (x)h fciCiC (/;x). (13) 



Using d9h into ( 13 1, the channel vector h k , (/; x) from the antennas of cluster C + c to the unintended 



user A; at location x + c' : x £ X can be written as 

hfc,c',c(/; x) = G c%c (x)G~ c 1 (a;)h fciC)C (/; x) + ^ c , fi {f\ x). (14) 

Joint Gaussianity, the mutual orthogonality of h k c , c (f; x) and e k c , c (/; x) and the fact that all covariance 
matrices are diagonal imply that h k cc (/; x) and e k c , c (f; x) are mutually independent. 

As anticipated before, ( 14) reveals qualitatively the pilot contamination effect. With LSUBF, as in JT8J, 



cluster C + c serves user k at location x + c with beamforming vector h fc cc (/; x)/||h fc cc (/;x)||, which 
is strongly correlated with the channel vector h kc , c (/;x) towards the unintended user k at location 
x + c', sharing the same pilot signal. It follows that some constant amount of interfering power, that 
does not vanish with N — > oo, is sent in the "spatial direction" of this user, leading to an interference 



limited system, as exactly quantified by Theorem 1 in Section IV-B For the family of LZFBF schemes 
considered in this work, the pilot contamination effect is less intuitive, and it is precisely quantified by 
Theorems 2 and 3 in Section HV-B I 

IV. MU-MIMO Precoders and Achievable Rates 

In the family of network-MIMO schemes considered in this work, the beamforming matrix V c (/) is 
calculated as a function of the estimated channel matrix H cc (/; X). The schemes differ by the type of 
beamforming employed. In particular, we consider LZFBF where any active user k at location x + c : 
x G X, imposes ZF constraints on J > clusters. A ZF constraint consists of the set of linear equations 

vH c ,(/;x , )h fcjCiC ,(/;x) = 0, V (j,x',c')^(k,x,c) (15) 

where Vj :C >(f; x') denotes the column of V c /(/) corresponding to user j at location x' + d : x' G X. 
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A. Beamforming 

Next we provide expressions for the cluster precoders for different choice of the parameter J. 
Case J = 0: In this case no ZF constraints are imposed. Hence, we have 

V c (/) = UNorm {h c , c (/; X)] (16) 

where the operation UNorm{-} indicates a scaling of the columns of the matrix argument such that they 



have unit norm. It is immediate to see that (16 1 coincides with the Linear Single-User Beamforming 



(LSUBF) considered in [18|. 



Case J = 1: In this case any active user imposes ZF constraints on its own serving cluster. This yields 
the classical single-cluster LZFBF, for which 

V c (/) = UNorm {h+ c (/;*)}, (17) 



where 



M+ = M 



M H M 



(18) 



denotes the Moore-Penrose pseudo-inverse of the full column-rank matrix M. It follows that Vk, c {f',x) 
is orthogonal to the estimated channels h • (/; x') for all other active users (j, x') ^ (k, x) in the same 
cluster C + c, i.e., ZF is used to tackle intra-cluster interference, but nothing is done with respect to ICI. 
Case J > 1: In this case, beyond the ZF constraints imposed to the serving cluster, each user imposes 
additional ZF constraints to J — 1 neighboring clusters in order to mitigate the ICI. Mitigating ICI through 
the beamforming design provides an alternative approach to frequency reuse and, in general, might be 
used jointly with frequency reuse. Let's focus on cluster C + c. This is subject to ZF constraints imposed 
by its own users (i.e., users in group X + c), as well as by some users at some locations x' + d : x' E X 
for J — 1 neighboring clusters c' / c. In order to enable such constraints, the c-th cluster controller must 
be able to estimate the channels of these out-of-cluster users. This can be done if these users employ 
training codebooks with indices q / q(c). In particular, J > 1 can be used only if the pilot reuse factor 
Q is larger than 1. In some cases, only the channel subvectors to the nearest BS in the cluster can be 
effectively estimated, since there are other users sharing the same pilot signal that are received with a 
stronger path coefficient. Then, the channel subvectors that cannot be estimated are treated as zero. Since 
these schemes are complicated to explain in full generality, we shall illustrate two specific examples, the 
generalization of which is cumbersome but conceptually straightforward, and can be worked out by the 
reader if interested in other specific cases. 
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Example 8: Consider Fig. 5(a) illustrating a 1 -dimensional system with C = {0, 1}, X = {x, 1 — x}, 
F = 1 and Q = 2. The beamforming matrix of each cluster c satisfies ZF constraints for its own served 
users and for the users in the m = 2 locations at minimum distance in the nearest neighbor clusters, 
c— 1 and c+ 1. These locations collectively use distinct columns of the training codebooks q ^ q(c). In 
the specific example, the reference cluster c = uses training codebook 3>o, an d the nearest locations 
on the left and on the right of cluster use the first SN/2 columns and the second SN/2 columns of 
the other training codebook 3>i, respectively. Hence, cluster controller can estimate all the channels of 
its own active users, at locations x, 1 — x, and of the users in adjacent locations — x and 1 + x, as shown 



in the figure. The beamforming matrix in the case of Fig. 5(a) is obtained as follows. Define 

Mc(/;*)=[Hc, c (/;*) H c _ 1jC (/;{1-x}) H c+1iC (/;{x})1 (19) 

Lv v ' v v ' v v ' J 

2MNxSN 2MNxSN/2 2MNxSN/2 

be the matrix of dimension 2MN x 2SN of all estimated channels at cluster controller c, where the 
first block corresponds to the desired active users and the remaining blocks correspond to users in the 
adjacent clusters for which a ZF constraint is imposed. Then, 

V c (/) = [UNormJM+Cf;*)}]^ (20) 

where [•]" extracts the columns from n to m of the matrix argument. This scheme can be generalized to 

J = Q, where each cluster c satisfies ZF constraints for the desired SN active users in its own cluster 

and for a total of (J — 1)SN additional users in the nearest location of neighboring clusters. 

Example 9: Consider Fig. |5(b) illustrating the same 1 -dimensional system as in Example [8] with 



a different beamforming design. In this case, the beamforming matrix of each cluster c satisfies ZF 
constraints for its own served users and all the users in the nearest neighbor clusters. However, some 
of these users share the same columns of the training codebooks q ^ g(c). In the specific example, the 
reference cluster c = uses training codebook $o> an d the clusters to the left and to the right the other 
training codebook <&i. Users at location — 1 + x use the same pilot signals of users at location 1 + x, and 
users at location — x use the same pilot signals of users at location 2 — x. Then the 0-th cluster controller 
assumes that the channel coefficients for BSs at larger distance are equal to zero. In the example, for 
locations — 1 + x and —x, only the subvector of dimension MN corresponding to the antennas of BS 
is estimated, while the remaining subvector to BS 1 is treated as zero. Similarly, for locations 1 + x and 
2 — x only the subvector of dimension MN corresponding to the antennas of BS 1 is estimated, while 
the remaining subvector to BS is treated as zero. 
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The beamforming matrix corresponding to the scheme of Fig. 5(b) is obtained as follows. Define 



M c (f;X)= H C;C (/;*) H c , c (/; X) ©L H c+1>c (/; X) ©R 



(21) 



2MNxSN 2MNxSN 2MNxSN/2 

be the matrix of dimension 2MN x 3SN of all estimated channels at cluster controller c, where 
indicates elementwise product and where 



1-MNxSN 

Omnxsn 



R 



Omnxsn 
Imnxsn 



are masking matrices that null out the subvectors of the channels that are treated as zero in the beam- 



forming design. Then, V c (/) is again given by (20 1 although in this case M c (/;Af) is given by (21 1 



instead of ( 19 1. This scheme can be generalized to J = C(Q — 1) + 1, where each cluster c satisfies ZF 
constraints for the desired SN active users in its own cluster and for a total of (J — 1)SN additional 
users in the neighboring clusters, with some channel sub-vectors set to zero. (> 



B. Achievable group spectral efficiency 

Letting R kc (f;x) denote the spectral efficiency (in bit/s/Hz) of user k at location x + c : x E X, 
served by cluster c according to a scheme as defined above, we define the group spectral efficiency of 
bin v(X) as 

F-l SN/m 

R*dF, c,j) = ^y, £ EE *£?(/; *) (22) 

/=0 cgA bB nV x&X fc=l 

In Appendices [A] and [Bj we prove the following results. 

Theorem 1: For given sets X,C, and system parameters F, S and Q, in the limit of N — > oo, the 
following group spectral efficiency of bin v(X) is achievable with LSUBF precoding (J = 0): 

S 



R x , c (F,C,J = 0) 



mF 



£log 1 + T 



CM in» 



s io,o^ 



x£X 



wherd_] 



tf*) = ^EEE 



j, + r ,(x) + ^((x) 



£c,c,b(x')g{x,c + b) 






and 



C(x) 



x'eA" 6ec cex>(o) 

v^ _J- / 1 s ^ g(x,c + b) c \ 

Z^ c ( x ) 1 c Z^ ( x ^ WW , 

ceP(0,0)\0 -c,c v ' \ 6eC yv ' ' J 



(23) 



(24) 



(25) 



7 In J24p and |25J it is assumed, without loss of generality, that cluster c — uses subband / = and training codebook 
q = 0. 
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with 

£ cc (*) = 7,EWz)> (26) 



C 

b&C 



are coefficients that depend uniquely on the system geometry, frequency and pilot reuse, but are inde- 
pendent of the loading factor S and of the BS antenna factor M. ■ 
As a corollary of Theorem [TJ we can recover the result of 1 18 1. It is sufficient to let M — > oo in ( 23 1 



and obtain the regime of infinite number of BS antennas per active user. Particularizing this for fixed S, 



C = 1, and Q = 1 as in 1 18 1, the group spectral efficiency becomes 



£SL R ^ F - '■ °> " ^ S> f 1 + EJF 



x&x \ ^ce7»(o,o)\o •' 



As observed in |18|, in this regime the system spectral efficiency is uniquely limited by the ICI due to 
pilot contamination. 

The next result yields the achievable group spectral efficiency of LZFBF in the case of single-cell 
processing (i.e., for C = 1). We define £{x) as the set of J — 1 clusters c/0 with centers closest to 
x G X (if J = 1 then £{x) = 0). Then, we have: 

Theorem 2: For given set X, C = 1 (i.e., C = {0}), and system parameters F, S and Q, in the limit 
of N — y oo, the following group spectral efficiency of bin v(X) is achievable with LZFBF precoding 
(J>1): 




M-JS 



£o,o,o(x) 



«(#) = X] cr o,c ) o(»)+ E ff(s,c) (29) 

c6'P(0,0)U£(a;) ceX'(0)-P(0,0)-£ , (x) 

ceP(o,o)\o vyv ' ;/ 
are coefficients that depend uniquely on the system geometry, frequency and pilot reuse, but are inde- 
pendent of the loading factor S and of the BS antenna factor M. ■ 



where 



and 



In passing, we notice that the limit of (28 1 for M — > oo, coincides with (27 1. Therefore, as observed 
in 1 18 1, in the "Massive MIMO" regime LZFBF yields no advantage over the simpler LSUBF. 

The case of LZFBF with multicell processing (C > 1) needs some more notation. First, as illustrated 
in Examples M and |9j we consider the cases J = 1, J = Q and J = C(Q — 1) + 1, referred to as 
cases (a), (b) and (c), respectively, for the sake of brevity. In case (c) it is useful to define b(x, c) = 
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argmin{(iA(x, c + b) : b € C}, i.e., the closest BS to location x G X in cluster c G £{x). For C > 1, 
an exact asymptotic ICI power expression cannot be found due to the complicated statistical dependence 
of beamforming vectors and channel vectors due to pilot contamination. However, the following result 
yields an achievable rate based on an upper bound on the ICI power (see details in Appendix [B]): 

Theorem 3: For given sets X,C with C > 1, and system parameters F, S and Q, in the limit of 
N — > oo, the following group spectral efficiency of bin v(X) is achievable with LZFBF precoding 
(J>1): 



R X fi{F,C,J>l) 



S 



CM-JS 



mF ^— ' 

xeX 



log 1 + T 



^0,0 ^ 



+ a{x) + ^fp(x) 



where 



a(x) = < 



Y ^.cW + Y ^o, c ( x )' in cases &> and ^ 

ceS(x)L){0} ceV(0)\0-S(x) 

ceX>(o)\o-£(a;) 



and 



with 



+ £, X °'o,c,6(^c)(a ; )+ Y S{x,c + b)\ 

ce£{x) \ beC\b(x,c) J 



ce7>(o,o)\o &ec v yv ' ; 



in case (c), 



\S x "> = c X ^ x ' c + h ^ ^o( x ) = q Y a °> c > b ( 



(31) 



(32) 



(33) 



(34) 



bee bee 

are coefficients that depend uniquely on the system geometry, frequency and pilot reuse, but are inde- 
pendent of the loading factor S and of the BS antenna factor M. ■ 



V. Scheduling and Fairness 

Consider a system with K bins, {v{Xq), . . . ,v(Xk-i), defined by sets X). of symmetric locations 
chosen to uniformly discretize the cellular coverage region V. The net bin spectral efficiency in bit/s/Hz, 
for each bin v{X^), is obtained by maximizing over all possible schemes in the family, i.e., over all 
possible clusters C of size C = 1,2, . . ., frequency reuse factor F, loading factor S, pilot reuse factor 
Q, and beamforming scheme indicated by J, the product 



max{l - QS/L, 0} x R Xk ,c(F, C, J) 



(35) 
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where the first term takes into account the pilot dimensionality overhead, and the second term is the 
spectral efficiency of the data phase for a given network-MIMO scheme, given by Theorems [1] [2] or 
[3] depending on the case. The maximization of ( [35] ) is subject to the constraint JS < CM, which 



becomes relevant for J > (i.e., for LZFBF precoding). Maximizing (35) requires searching over 
a discrete parameter space (apart from S, which is continuous). The simple closed-form expressions 
given in Theorems [T] [2] and [3] allow for an efficient system optimization, avoiding lengthy Monte Carlo 
simulations. 

Suppose that for each bin v(X k ), the best scheme in the family of network-MIMO schemes is found, 



and let R*(X k ) denote the corresponding maximum of (35). Then, a scheduler allocates the different 
bins on the time-frequency slots in order to maximize some desired network utility function of the user 
rates. With randomized or round-robin selection of the active users in each bin, each user in bin v(X k ) 
shares on average an equal fraction of the product p k R k (X k ), where p k is the fraction of time-frequency 
slots allocated to bin v(X k ). Under the assumption that users in the same bin should be treated with 
equal priority, we can focus on the maximization of a componentwise non-decreasing concave network 
utility function of the bin spectral efficiencies, denoted by G{Ro, • • ■ , Rr-i)- The scheduler determines 
the fractions {pk} by solving the following convex problem: 

maximize G(Ro, • • • , Rk-i) 
subject to R k < p k R*(X k ), 

K-l 

Y,Pk< 1 ^ Pk>0- (36) 

fc=0 

For example, if Proportional Fairness (PF) (30j is desired, we have 

A'-l 

G{R ,...,R K - 1 ) = J2 l °g R k, (3V) 

fc=0 

resulting in the bin time-frequency sharing fractions p k = \jK (each bin is given an equal amount of 
slots). In contrast, if the minimum user rate is relevant, we can impose max-min fairness by considering 
the function 

G(R ,...,R K - 1 )= min R k . (38) 

k=0,...,K-l 



This results in the bin time-frequency sharing fractions p k = V K-* ( ;yfc) i — ■ More in general, a whole 



family of scheduling rules including (37) and (38) as special cases is obtained by using the so-called 



a-fairness network utility function, as defined in p9|. 
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VI. Numerical Results and Discussion 

In this section, we present some illustrative numerical results showing the following facts: 1) the 
asymptotic large-system analysis yields a very accurate approximation of the performance (obtained by 
monte Carlo simulation) of actual finite-dimensional systems; 2) the proposed architecture based on 
partitioning the users' population in homogeneous bins and serving each bin with specifically tailored 
network-MIMO scheme provides significant gains with respect to the "Massive MIMO" scheme of [18], 
in the relevant regime of a finite number of BS antennas per active user. 

At this point, it is worthwhile to make a comment on the convergence of finite-dimensional systems to 
the large-system limit as N — > oo. The approach of analyzing multiuser communication systems affected 
by random parameters (such as random channel matrices or random spreading matrices in CDMA) in the 
limit of large dimension in order to exploit the rich, powerful and elegant theory of limiting distributions 



of large random matrices [20] was pioneered in [40 1, |41| in the case of random-spreading CDMA, 



and successfully applied to single-user MIMO channels (see for example [42 1— [45]) and to network- 



MIMO cellular systems 1 11 1, [21 1, |33|-|35|. It was observed experimentally and proved mathematically 
(e.g., see (22J, |23|) that the convergence of the actual finite-dimensional system spectral efficiency to 
the corresponding large-system limit is very fast, as the system dimension N increases. In particular, 
well-known techniques can be used to analyze the "fluctuation" of the quantities of interest around their 
large-system limit for large but finite N. Typically, finite-iV "concentration" results are analogous to the 
Central Limit Theorem for i.i.d. random variables, but the convergence is much faster owing to the fact 
that the eigenvalues of the matrices appearing in the spectral efficiency expressions are strongly correlated 
(see for example the discussion of the results in J23]). Since this convergence analysis is standard but 
cumbersome, and invariably points out that the large-system results are very good predictions of the 
actual performance in cases of practical interest, here we focused only on the limit for N — > oo and 
provided a comparison with finite-dimensional simulation in order to corroborate our claims. 

Fig. [6] shows the group spectral efficiency in ( [35] ) as a function of the bin locations within a cell for 
different schemes identified by the parameters (F, C, J) and Q. The group spectral efficiency is obtained 
by Monte Carlo simulation (dotted) and and is compared against the corresponding values from the 
closed-form large-system analysis (solid), for the 1-dimensional cell layout of Fig. [2] with B = 24 BSs, 
M = 30 antenna factor per BS, L = 40 coherence block dimension factor, and K = 10 bins in each 
cluster, where clusters and location bins are given in Example [3] and [5] with x uniformly distributed 



in [0,1/2]. The pathloss model is the same as in [28|, where g(x,b) = Gq/(1 + (dh(x,b)/5) a ), with 
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Go = 10 6 , a = 3.76, and 5 = 0.05, and reflects (after suitable normalizations) a typical cellular scenario 
with 1km diameter cells in a sub-urban environment. The (1,1,1) scheme with Q = 1 yields the best 
performance for locations near the cell center. However, at the cell edges, C = 2, J = 2, or F = 2 
(not included in the figure) attains significantly better performance. As anticipated above, the limit for 
N — > oo matches very accurately with the Monte Carlo simuation even for very small N (we used the 
minimum possible N = 1 in this case). For this reason, in the following we present only the results for 
the large-system limit, obtained using the closed-form expressions of Theorems [T] [2] and [3] 

In the 2-dimensional case, we considered the layout with B = 19 hexagonal cells as shown in Fig. [1] 



For comparison, we assume the same system model as in 1 18 1, with channel coherence block dimension, 
the cell radius, and pathloss model given by Lp = 84, 1.6 km, and g(x, b) in the same form as before, 
with parameters Go = 10 6 , 5 = 0.1 km, and a = 3.8, respectively. Log-normal shadowing, considered 



in 1 18 1, is not considered here (see the comment in Section [VlT]). We considered schemes with cluster 



size G = 1 and G = 3, K = 16 bins with 48 user locations, where the cluster and bin layout are 
qualitatively described in Fig. [2] The frequency reuse factor F and pilot reuse factor Q are selected 
between 1 and 3 and, when F or Q = 3, the frequency subbands or training codebooks are allocated to 
clusters as shown in Fig. [T] where different colors denotes different subbands or training codebooks. Fig. [7] 
illustrates the optimum over the family of network-MIMO schemes for (a) M = 20 and (b) M = 100. 
In both cases, (1, 1, 1) is optimal in the inner part of the cell, but schemes with (3, 3, 1) or (3, 1, 1) yield 
better performance for locations near cell boundary. We notice also that the inner area within which the 
(1,1,1) scheme is the best increases with the BS antenna factor M. 

Next, we compare the performance of the proposed architecture with the one advocated in p8| . Fig. [8] 
shows the bin-optimized spectral efficiency normalized by the spectral efficiency of (1, 1,0), Q = 1 
scheme (corresponding to [18]), under two-dimensional layout with M = 50. The gain of the proposed 
architecture ranges from about 40% to 580%, depending on the users' location. Fig. [9] shows the 
system throughput as a function of M in the two-dimensional layout. The throughput obtained for fixed 
parameters in the considered family of network-MIMO schemes, as well as for the bin-optimized mixed- 
mode letting the scheduler choose the bin and the associate network-MIMO scheme as described in 
Section Mis shown, and compared with the reference performance of the (1,1,0),Q = 1 scheme. The 
cluster scheme includes two cases where the cluster pattern is fixed as one of two shown in Fig. [2] or 
can be switched to the closest one depending on the user locations. The system throughput of Fig. [9] is 
obtained under PF scheduling (see ([37])). For the sake of comparison, we assumed 20 MHz bandwidth 



and the coherence block size L = 84 as in [18] (considering the parameters of 3GPP LTE TDD system). 
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As the figure reveals, the (3, 3, 1) schemes perform very well for small M < 20 while, as M increases, 
the (1, 1, 1) scheme is best. The bin-optimized architecture improves the throughput further at any value 
of M. The dotted horizontal line in Fig. [9] denotes the cell throughput claimed in [18] in the limit of an 
infinite number of transmit antennas per user with the (1, 1, 0), Q = 1 scheme. We notice that this limit 
can be approached very slowly, and more than 10000 antennas per BS are required (clearly impractical). 
For finite number of antennas, the proposed architecture achieves the same throughput of the scheme in 



1 18 1 with a 10-fold reduction in the number of antennas at the base stations (roughly, from 500 to 50 



antennas, as indicated by the arrow). 



VII. Conclusions 



We studied a novel network-MIMO TDD architecture that achieves spectral efficiencies comparable 
with the recently proposed "Massive MIMO" scheme, with one order of magnitude less antennas per 
active user per cell. The proposed strategy operates by partitioning the users population into geographically 
determined "bins". The time-frequency scheduling slots are allocated to the bins in order to form 
independent MU-MIMO transmissions, each of which is optimized for the corresponding bin. This 
strategy allows the uplink training reuse factor, the frequency reuse factor, the active user loading factor, 
the BS cooperative cluster size and the type of MU-MIMO linear beamforming to be finely tailored 
to the particular user bin. We considered system optimization over 1 -dimensional and 2-dimensional 
cell layouts, based on a family of network-MIMO schemes ranging from single-cell processing to joint 
processing over clusters of coordinated BSs, with linear precoders ranging from conventional linear single- 
user beamforming to zero-forcing beamforming with additional zero-forcing constraints for neighboring 
cells. In order to carry out the system optimization, we developed efficient closed-form expressions for 
the achievable spectral efficiency for each scheme in the family and each bin in the cellular layout. 
Our closed-form analysis is based on the large-system limit, where all system dimensions scale to 
infinity with fixed ratios, and make use of recent results (by some of the authors of this paper) on 
the analysis of cellular systems with linear zero-forcing beamforming and channel estimation errors 



1 1 1 1 . The performance predicted by the large-system asymptotic analysis is shown to match very well 
with finite-dimensional simulations. Our numerical results show that different schemes in the considered 
family achieve the best spectral efficiency at different user locations. This suggests the need for a location- 
adaptive scheme selection to serve efficiently the whole coverage region. The resulting overall system 
is therefore a "mixed-mode" network-MIMO architecture, where different schemes, each of which is 
optimized for the corresponding user bin, are multiplexed in the time-frequency plane. 
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As a final remark, it is worthwhile to point out that the approach of partitioning the users in homoge- 
neous sets, serving each set according to a specifically optimized scheme, and using a scheduler to multiple 
different schemes in order to maximize some desired network utility function, can be generalized to the 
case of shadowing, and to the case of users with different mobility. This generalization is, however, non- 
trivial. For example, in the presence of slow frequency-flat shadowing, "bins" are no-longer uniquely 
determined by the users geographic position. Rather, the set of large-scale channel gains (including 
shadowing), should be used to classify the users in equivalence classes. Also, in the presence of users 
with different mobility, users should be classified also on the basis of their different channel coherence 
block length. The issue of how to optimally cluster users into equivalence classes that can be efficiently 
served in parallel, by MU-MIMO spatial multiplexing, represents an interesting and important problem 
for future work. 

Appendix A 
Proof of TheoremQ] 

We focus on the reference cluster C (i.e., c = 0), with corresponding served group of locations 
X = {xo, . . . ,x m -i}. For the sake of notation simplicity, we omit the subchannel index /, and let V 
denote the set of clusters active on the same subchannel of cluster 0, and V denote the set of clusters that 
share the same pilot block as cluster 0. From ([5]), the (scalar) signal received at some symbol interval of 
the data phase, at the /c-th active user receiver at location x £ X, is given by 

Vk,o( x ) = u M(£)vfc )0 (x)h fcA0 (x) (39a) 

+ ^2 u jA x ) v ^o( x )^k,o,o( x ) + Yl Y u Jfi(x')v^ (x')h kfi0 (x) (39b) 

j^k x'eX\x j 

+ Y, YJ2 u ^'( x >^( x ')hk,o,c( x ) + z kfi (x), (39c) 

c>ev\ox>ex j 

where Uj )C >(x') denotes the code symbol transmitted by cluster d , to user j at location x' + d : x' E X. 

With LSUBF downlink precoding, we have 



(x') 



h ? , c , c ,(x') h ? , c , c ,(*') (40) 



-3,C,e 



Using the MMSE decomposition d9b, we isolate the useful signal term from (39ai, given by, 



«fc,o(a;)vJJo(a;)Efe,o,o( aj )- ( 41 ) 



22 



The sum of the residual self-interference term due to the channel estimation error with the signals in 



(39b I transmitted by cluster to the other users, results in the intra-cluster interference term 



u k,o(x)^ (x)e kfifi (x) + ^u jfi (x)v\j [ (x)h kfi0 (x)+ ^ ^2u jfi (x')v^ (x')h kfi0 (x). (42) 

j+k x'EX\x j 



Finally, the ICI term and background noise are given in (39c I. 



A standard achievability bound based on the worst-case uncorrelated additive noise [15| yields that 
the achievable rate 



? (N) 



JO*)=E 



E 



log 1 + 



(useful signal term| 2 | h k0i 



E 



|noise plus interference term| 2 | h fe00 ( 



(43) 



Both numerator and denominator of the Signal-to-Interference plus Noise Ratio (SINR) appearing inside 



the log in (43 1 converge to deterministic limits as N — > oo. We will use extensively the representation 
of the channel MMSE estimates as 



-],c,c 



,{x') 



1 



.1/2 



N 



' \ x ) a j,e,c' \ x ) 



(44) 



where the vectors aj )C)C '(a/) are i.i.d. ~ CA/"(0, Icmn), with generic components denoted by {a n ^ '■ n = 
1, . . . ,MN} for all b G C. We will also make use of the following limit, which follows as a direct 
application of the strong law of large numbers: 






bee 



N 



MN 



^ \a n ,t 



1 2 a -s- 



CMC Jx') 



n=l 



where £ ,(a/) is defined in (26 1. Using (40l, the SINR numerator is given by 

2 ^ 



E 



«fc,o(»)vfc )0 (a?)h fco ,o(a ; ) 



h 



fe,0,0 



(x) 



E 



\uk,o(x)\' 



ifc,0,0 



(x) 



h 



k,0,0\ 



1 

S 

a.s. CM 



hk,o,o( x ) 



S -o.o 



(45) 

(46a) 

(46b) 
(46c) 



where in (46a) we used the LSUBF definition (40). 



Next, we notice that all the terms forming interference and noise are uncorrelated. Hence, the condi- 
tional average interference power can be calculated as a sum of individual terms. The self-interference 
due to non-ideal CSIT is given by 



E 



u k,o( x ) v k,o(x)e kflfi (x) 



iifc,o,o( x ) 



1 „<* , ,„_ 2r H 



-gjy\\hk,o,o( x )\\ hfc,o,o( x ) s o,o(a;)h fci o,o( a: ) 

1 EfcgC £o,0,b( x ) a Ofl,b(.x)-k EniT \ a n,b\ 2 

SN CMZ 00 (x) 





(47) 
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where (47 1 follows from noticing that 



MN 



^2^o,o,b{x)a 0fijb (x)— ^ \ a n,b\ 2 ^ M ^2^o,o,b{ x ) a o,o,b( x )^ 



bee 



n=l 



bee 



which is a finite constant. 

Following very similar calculations (omitted for brevity) and recalling that g(x, b) = Co,o,b{x)+<Tofl,b(x) 



(see (11)) and that SN/m users per location x' G X are active, we obtain the intra-cluster interference 



power terms as 



^EE 



€o,o,b(x')g(x,b) 



mC 



(48) 



e (t ; ) 

x'exbec -o,o^ > 
Next, we consider the ICI power term. In doing so, we must pay attention to the pilot contamination 



effect. In particular, we have to separate all contributions in ( 39c I coming from the A;-th beam of clusters 
d £ V (i.e., for users sharing the same pilot signal of the reference user k at x G X), from the rest. The 
two contributions to the ICI are 



-^same pilot — z_^ u k,c>(x)v kc ,(x)h k0c ,(a 
ceV\o 



(49) 



and 



^•no same pilot — Z^ 2-^ u j,c'(x)v jc ,(x)h k0c ,(x) + ^ ^ / „ u j,c {% ) Vj, c ' ( x )hfc,o,c ( 3 

ceV\oj^k c'eT\0x<ex\x j 



dev-v x'ex j 



(50) 



Both 2^ same pilot and I no same pilot Me independent of h k00 (x). Therefore, conditioning in the ex- 
pectation can be omitted. Each individual term appearing in the sum ([50]) yields 



iVE 



u j,c>(x)v jc ,(x')h kfic ,(x) 



1 ^tr(a c > |C >(x / )G , c >(x)) 
S CMi_ clc ,{x') 

1 sp tc>,c, b{x')g(x,c' + b) 



SC 



bee 



£, ,(x') 



Summing over all terms, we have 



E 



In order to evaluate E 



-no same pilot I 



IT ■, I 2 

l-^same pilot I 



1 



Cc',c',b(x')g(x,c' + b) 



E 1 V"^ V^ Sc^i 
m c 2^ Z^ c ( x i\ 

c'ev\o x'exbec 2-c,c> y > 



(51) 



, we use the decomposition (14) applied to h kQc ,(x), namely, 

(52) 



h k ,o,c'(x) = G ,c(x)G c , 1 c ,(x)h A . iC , iC ,(x) + e fc)0iC ,(x). 



2 1 



The general term in (091) yields 



E 



Uk,c'(x)v^ c ,(x)h kAc ,(x 

1 r ^ /^h ^ 

^ E [\\hk,c',c' (x)\\~ 2 (hk,c',c (x)Go,c {x)G~, l d (x)h kc 

+ llfc,c',c' ( x )§fc,0,C ( x )§fc,0,c' ( x )iifc,c',c' ( x ) 

M ( g{x,d + b) t V 



where, inside the expectation, we used the a.s. limits d45]>, 



£fc, C ',c 0)Go,c Oc)G c A, (x)h k , , (x) 



MN 



y^ g(x,d + b) nJLv^i 



fcec 



g(c' + x, d + 6) 



n=l 



ME%^e c -^w, 



6ec 



g(x,b) 



with g(c' + x, c' + b) = g(x, b), and h k c , c ,(x)e k c /(x) — >' 0, and the limit 



E 



h k ,c',c (x)e kfic , (x)eg 0>c , (x)h fcc , c , (x) 



-E 



N 
1 

iV2 



fe*,c',c'( aj ) S 0,c'(»)fe*,c' ) c'( ar ) 



tr(H c%c ,(x)S , C '(x)) ^0 



Summing over all such terms, we obtain 

CM 



E 



IT I 2 

I same pilot I 



E 



'E^^w 



ceP\o-c,c v ' \ bee av ' ; 



(53) 



(54) 



(55) 



Using (46ci, (48l, (51) and (55l in (43 1, recalling that the noise variance is equal to 1/F, summing over 



all users in the reference group X and observing that the system is symmetric (by construction) with 
respect to any cluster and any subband, we find the normalized group spectral efficiency of bin v{X) in 



the form (23 1. 



Appendix B 
Proof of Theorems |2] and [3] 



With reference to Section |IV-A[ we consider LZFBF with or without inter-cluster interference (ICI) 
constraints, depending on the value of J > 1. In particular, each cluster creates JSN beamforming 
vectors for SN users in the same cluster and (J — \)SN users in the neighboring clusters. Any active 



user in the system, at any given scheduling slot, imposes ZF constraints (see (15)) to J clusters. We 



restrict our attention to the three cases treated in Section IV- A again referred to as cases (a) J = 1; (b) 
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J = Q; and (c) J = (Q — 1)C + 1. As before, we focus on the reference cluster C (c = 0) with served 
group X. In such cases, the beamforming matrix Vq is given by the column-normalized Moore-Penrose 



pseudo-inverse of the estimated channel matrix (see (17i, (19) and (21)), of size CMN x JSN. This 
matrix is formed by blocks of size N x SN/m, in the form: 

M ,o M ,i ••• M ,j m -i 

Mi, Mi,i ••• M 1]Jm _i 



M 



(56) 



McM-1,0 M-CM -1,1 ■ ■ ■ McM-l,Jm-l 
where each block My corresponds to the MN antennas of BS b G C and to the SN/m active users 
in some location x' with respect to which ZF constraints are imposed. For the purpose of analysis, it is 
important to notice that the blocks My are mutually independent, and each block contains i.i.d. elements 
with mean zero and variance that depends on the block. For example, if block My corresponds to a user 
location d + x' : x' G X and BS b G C such that the corresponding channel vectors are estimated from 



the uplink training phase, the elements of My are ~ CN(0,£, C 'fi,b(,x)/N) (see (111 in Section III-B I. 
Instead, if block My corresponds to a user location and a BS such that the corresponding channel 



vectors are treated as zero (see Section IV-A Example |9l), then My = (all- zero block) 



The signal received by user k at location x G X takes on the form (39). From |11 Theorem 3] the 
rate 



j(N) 



iO*) = E 



E 



log 1 + 



| useful signal term| | Vkfl(x),h 



k,0,0\ 



E 



(57) 



|noise plus interference term| 2 | Vkfi(x),h k00 (x) 
is achievable, assuming that the receiver has perfect knowledge of its own estimated channel and 
beamforming vector. The large-system limit of the LZFBF useful signal coefficient v^ (x)h fc 00 (x) 



for channel matrices in the form (56) was obtained in 1 11 Theorem 1] (details are omitted for the sake 



of brevity). While in general this limit is obtained as the solution of a fixed-point equation that must be 
solved numerically, the user locations and the BS positions considered in this paper satisfy the symmetry 



conditions given in [11 Section III. A], and the asymptotic useful signal term admits a simple closed 



form given in pT| eq. (32)]. Applying this result we obtain 



^A x ')%,c'c'{ x ') 



(CM-JSVAx') 



(58) 



for any j, c'Abs n V and x' G X. By construction, it is assumed that JS < CM. Notice the well-known 
dimensionality limit of the ZF beamforming: when the ratio of the number of ZF constraints per degree 
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of freedom (antenna) JS/(CM) tends to 1, the effective useful signal term vanishes. Using (58 1 and 
recalling that E[|tifco( a; )| 2 ] = 1/S we obtain the SINR numerator in (57 1 as 

2 - 1 . - CM - JS 



E 



Uk,o(x)v ko (x)h koo (x) 



v kjQ (x),h k0fi (x) 



S 



L W- 



(59) 



As done in Appendix |Al we consider the intra-cluster, ICI and noise terms in the SINR denominator of 



(57 1 separately. The ZF constraints imply that v^ (x')h_ fc 00 (x) = for all (j, x') / (k,x) : x 1 € X. 



Therefore, the intra-cluster interference term given in general by (42 1, reduces to 



X ^ n i,o(x / )v^ (x / )e fcoo (j;), 
x'ex j 

and its conditional second moment is given by 

2 



E 



Y. x >£xY.j u j,o( x ') v "o( x ')ek,o,o( 



v fc) o(x),h fc)0i0 (a;) 



1. 



E 



5 
SN 



tr(V{/E 



5fc,o,o( x )§fc,o,o( 



V f 



v/,o(,xj 



^E[tr(v V H S ,o(x))|v M (x) 



(60) 



bee 



where the last line holds from the following lemma 

Lemma 1: If the user locations and BS positions are symmetric (in the sense defined in fTTj Section 
III. A], which is satisfied for the choice of lattice-based user locations sets considered in this work), the 
matrix V C V^ satisfies the "constant partial trace" property in the large-system limit, i.e., the sum of 
a block of MN consecutive diagonal elements of V C V^ corresponding to the antennas of BS b G C, 
divided by SN, tends to the constant limit 1/C, independent of the BS index b. 

Proof: The sum of the 6-th diagonal element block of size MN of V c Vjj 1 , divided by SN, is written 
as 



1 

5iV 



bMN 

E 

<=(ii-l)MiV+l 



* C * c 



1 

SN 



bMN 



SN/m 



X! EE lKc(*)y , 



(61) 



l={b-l)MN+l x&X j=l 

where [vj jC (x)]g denotes the £-th element of the column Vj jC (x) of V c . Next, for the sake of clarity, we 
identify some terms in the notation of this paper with the corresponding terms in the notation of [ fTT] 
Lemma 1]. To this purpose, we enumerate the locations x G X as k = 1, . . . , m. The transmit power to 



8 Notice that since the columns of Vo have unit norm we have -^tr(VoVo) = ^tr(Vo Vo) = S. However, since So,o(i) 
is block-diagonal with constant diagonal blocks cro,o,b(x)lMN, the constant partial trace property is needed in order to obtain 

1 50 1. 
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each active user in the cluster is qu = l/S and the fraction of active users per location is [i k = S/m. 
Both quantities are constant with k. Also, the term 

1 



N 



|2 



bMN SN/m 

J2 2 |[ v j,c(z)L 

£=(&-l)M7V+l i=l 
coincides with the term 9^ k defined in JTT] , eq. (26)] Therefore, the term in (61 1 can be written as 

h TJk=i e b,k = YT=i Qkh,k- Applying |TTj Lemma 1], we have 

m rn/C 

/ J Qk6b,k = 2_^ Ik/J-k 
k=l k=l 



mlS 1 
C ' S' m = C' 



Next, we consider the ICI term and we separate it into % same D jj ot and X no same D ji f The condi- 
tioning with respect to Vk,o(x), ^k,o,o( x ) is irrelevant for the ICI terms and therefore it can be omitted. 



First, we evaluate the pilot contamination effect for the case C = 1. Using (52i, (59 1 and (49), we obtain 

2"i 



E 



IT" 

l-^same pilot I 



E 



Y^ u k,c(x)v" c ,(x)h kAc 



c'ev\o 



g Yl E \ v k,c'( x ){ G o,c'(x)G c ,] cl (x)h kcl>c ,(x) + e kfic ,(x] 



c'&V\0 

c'e-p\o 
M-JS 



S 



g{x,c'Y ' 



g(x,0) 



E 



g(x,c') 



S ^ \g(x,0) 



E 



v k,c'( X )hk,c>,c>(x) 



C, ,(x) 



+ E 



v k,c'( x hk,0,c'(x) 



(62) 



where we used ( |58j ) and 

E 



v k,c'( x hk,0,c'(. x ) 



N 

1 



-E 



Vfe, c /(x)S!o,c'(^)vA : ,c'(a ; ) 



= — max{cr , C ',b(a;)} -> 
iV feec 



(63) 



For C > 1, we have G Qtd {x)G-} c ,{x) = diag ( g( Y'+ 6) I M iV :bec). While v£ c ,(x) and h fcjC , iC ,(x 



are orthogonal by design, the term E 



v fc. C ' (^) G o, C ' (x)G c , 1 c , (x)h k , , (x) 



is generally non-zero and 



does not admit a simple closed-form since v& ]C / (x) and h k c , c , (x) are statistically dependent. In order to 
overcome this problem, we consider the following upper bound obtained by applying Cauchy-Schwartz 
inequality: 



v k,c>(x)G ,c>(x)G c } c ,(x)h k , Ax) < ||v fc)C /(x)|| 



»o,c 



( x )GA,( x )h k ,c'.c'( x ) 



(64) 
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Recalling that Vfe ;C '(x) has unit norm and that E[h fc c , c ,(x)h k c , c , (a 



n" 



S c ' ;C '(x), we obtain 



E 



It i 

l x same pilot I 



cm v 1 ( gjx^ + by f 
c'gp\o feec 



(65) 



g(x,b) 

Next, we examine the ICI power caused by the term X no same pilot- I n tne case of J > 2, this can 
be further decomposed into a term 2jQ-ZF' taking into account the clusters which have a ZF constraint 
with respect to user k at location x € X, and ^no-ICI-ZF' taking into account all other clusters. In order 
to proceed, we define £{x) as the set of J — 1 clusters c/0 with centers closest to x € X. With these 
definition, we have 

X ICI-ZF = Y^ J2^2 u J,c( x ') v f,c'( x, )hk,o,c( x ) ( 66 ) 

c'e£(x) x'ex j 

and 

X no-ICI-ZF = Yl ^ n J>'(x)v^ c ,(a;)h A , i0iC ,(3;)+ ^ ^ ^2u jiCr (x')v^ c ,(x')h kfi)C ,(x) 
cev\oj^k d <=v\o x' ex\x j 



+ £ E 5Z«i,c(»')vJ c /(a / )h fci o jC '(a!). 



(67) 



We start with the terms in ( 67 1. For d G 7^0, by definition of LZFBF we have that v^ c , (x')h k c , c , (x) = 



for all (j,x') ^ (k,x). For C = 1, since Go, C '(x)G c , 1 c , (x) is a scaled identity matrix, using (52i we 
have that 

v" c , (V )hfc,o,c (x) = v£ c , (x')e k0c , (x) (68) 

For c' G V — V — £(x), the vectors vl^x') and h fc0 c ,(x) are statistically independent. Hence, for C = 1 
we have 



lim E 
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7V-s>oo 



l X no-ICI-ZFl 



^c'e-p\o 



V C ,V C H ,£ , C '(* 
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c'ei5-'P-£(x) 



SJV 



E 



trfv^V^Go.c^x) 



(69) 



c'e"P\o c'eT>-vs(x) 

where in (69 1 we used Lemma n\ for matrix V c <V|j. 

For C > 1, because of the block-diagonal form of the matrix Go^ c '(x)G~, 1 c ,(x) already mentioned 
before, ( [68] ) does not hold in general. An upper bound to the interference power in this case can be 
obtained by assuming that the MMSE estimate h k c , (x) of the channel from user k at location x G X 
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and the antennas of cluster d is so noisy that it can be considered equal to zero. Therefore, the estimation 
error e k ^^{x) nas covariance iGo, c '(s), and we obtain 

|2 



lim E 



Ko. 



ICI-ZFI 



s E 

c / eP\0 - £(cc) 



So.c'W 



(70) 



where 5 ,(x) is defined in (34). Finally, we consider 2jq_zf in (66). We distinguish different cases 



depending on the value of J. In case (a), this term is zero. In case (b), we have v^ c , (x')h k d (x) = 



for all j and all x G £{x). Hence, similarly to (60), we obtain 

E 



TCI-ZFl 



[x 



(71) 



c'e£(x) 
In case (c), the ZF vectors Vj jC -(x / ) of cluster d G £(a;) are calculated by imposing orthogonality 
conditions with the segment of the estimated channel vector h k c , (x) corresponding to the MN antennas 
of the closest BS. In order to proceed further, we define the index of the closest BS to location x in 
cluster d G £(x) as b(x, c) = argmin{c(A.(a?) c + 6) : 6 G C}. Then, the effective channel used for ZF 
beamforming calculation is given by 



ifc,0,c' 



*6(*,c')fe*,0,c'( a O 



where S^^gA is a selection matrix, with all elements equal to zero but for a block of diagonal elements 
corresponding to the positions of the MN antennas of BS b(x, d). By construction, and using the MMSE 
decomposition, we have 



Vj, c >(x')h kAc ,(x) = vj c ,(x / ) (ty b ( Xfi ,)h kfic ,(x) + {Icmn - & b ( x , c >))hk >0iCl (x) + e kfi , C '(a 

= v ^,c'( x> ) {(ICMN ~ & b ( XtC r ) )h kfi)C ,(x) + &b(x,c>) e k,0,c'(x)) 

= v " c ,(x')e ki o,c>(x) 



(72) 



where e k o,c'( x ) * s independent of all beamfomrming vectors V c / of cluster d G £{x), and has covariance 
matrix 
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((I 



CMN 



^b(x,c'))Go,c'(x)(IcMN - *b(x,c')) + ^b(x,c')^0,c'{x)^fb(x,c')) 



Using these facts and operating similarly as in (71 1, we obtain 
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^2 g{x,b + c')+ a ,c>Mx,c>) (x) 



(73) 



.beC\b(x,c') 
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From (59 1, (60 1, (62), (69 1, (71 1, and (73 1, the normalized group spectral efficiency for C = 1 and J > 1 



is obtained in the form ( [28] ) For the cluster case C > 1, using bounds ( [65] ), and (70 1, we obtain the 
achievable normalized group spectral efficiency given by (pl])p] 



A lower bound to an achievable rate is also achievable. 
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Fig. 1. Two dimensional hexagonal cell layout with B — 19. The triangle marks indicate the BS positions (points of Ab s ) and 
the red triangle marks indicate the points of A. The insides of the large thick-lined hexagon and the small hexagons denote V 
and Vi, for b 6 Abs, respectively. 
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Fig. 2. Cluster pattern geometry and user bins in one-dimensional and two dimensional layouts. 
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Fig. 3. 1 -dimensional layout with C = 2 and F = 2. 
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Fig. 4. Pilot reuse and contamination for C — 2, F — 1, and Q = 2. The dashed lines show the contamination from a user 
sharing the same pilot signal, in another cluster. 
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Fig. 5. Two cases of a precoding scheme for C = 2, F — 1, and Q = 2, with J = Q (a) and J = C(Q — 1) + 1 (b). 
The dashed lines indicate the channel vectors to out-of-cluster users for which a ZF constraint is imposed. In Figure (b), the 
light-shaded dashed lines indicate the channel vectors assumed zero in the beamforming calculation. 
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Fig. 6. Bin spectral efficiency vs. location within a cell obtained from the large system analysis (solid) and the finite dimension 
(N = 1) simulation (dotted) for various (F, C, J). M = 30 and L = 40. 
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Fig. 7. Optimal scheme at each user locations. M = 20 and 100, K = 16, and L = 84. 
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Fig. 8. Bin-optimized spectral efficiencies normalized by the (1,1,0) spectral efficiencies. M — 50, K — 48,and L — 84. 
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Fig. 9. Cluster sum throughput vs. M for various (F, C, J) and for a bin-optimized architecture under PF scheduling. K = 48 
and L — 84. The arrow indicates that the proposed architecture achieves the same spectral efficiency as the fixed scheme (1,1,0) 
of 1181, with a 10-fold reduction of the number of BS antennas. 



